Architecture Scalability of Parallel Vector Computers with a Shared Memory
نویسنده
چکیده
Based on a model of a parallel vector computer with a shared memory, its scalability properties are derived. The processor-memory interconnection network is assumed to be composed of crossbar switches of size b b. This paper analyzes sustainable peak performance under optimal conditions, i.e., no memory bank conflicts, sufficient processor-memory bank pathways, and no interconnection network conflicts. It will be shown that, with fully vectorizable algorithms and no communication overhead, the sustainable peak performance does not scale up linearly with the number of processors p. If the interconnection network is unbuffered, the number of memory banks must increase at least with O(p logb p) to sustain peak performance. If the network is buffered, this bottleneck can be alleviated; however, the half performance vector length still increases with O(logb p). The paper confirms the validity of the model by examining the performance behavior of the LINPACK benchmark.
منابع مشابه
Thesis Proposal: Design and Evaluation of the Hamal Parallel Computer
We wish to investigate design principles for general-purpose shared memory computers. Specific areas of interest are silicon efficiency, scalability, and RAM integration. Experiments will be performed using a cycle accurate simulator for the Hamal parallel computer. In this proposal we present an overview of the Hamal architecture and our plans for evaluating various mechanisms.
متن کاملAutomatic Tuning of Whole Applications:
For many years, retargeting of applications for new architectures has been a major headache for high performance computation. As new architectures have emerged at dizzying speed, we have moved from uniprocessors, to vector machines, symmetric multiprocessors, synchronous parallel arrays, distributed-memory parallel computers, and scalable clusters. Over the past year, clusters based on multicor...
متن کاملUsulDSM: A Page-based Recoverable Distributed Shared Memory Project Report
UsulDSM is a page-based recoverable software distributed shared memory system designed for network of computers that don’t have access to a physically shared memory. In this report we describe architecture of the UsulDSM and discuss its design and implementation. We also evaluate its performance through a simple parallel application that uses UsulDSM. We also analyze UsulDSM’s scalability and t...
متن کاملParallization of Stellar Atmosphere Codes
Parallel computing has turned out to be the enabling technology to solve complex physical systems. However, the transition from shared memory, vector computers to massively parallel, distributed memory systems and, recently, to hybrid systems poses new challenges to the scientist. We want to present a cook-book (with a very strong, personal bias) based on our experience with parallization of ou...
متن کاملShared-Memory Vector Systems Compared
The NEC SX-5 and the Cray SV1 are the only shared-memory vector computers currently being marketed. This compares with at least five models a few years ago (J90, T90, SX-4, Fujitsu and Hitachi), with IBM, Digital, Convex, CDC and others having fallen by the wayside in the early 1990s. In this presentation, some comparisons will be made between the architecture of the survivors, and some perform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Computers
دوره 47 شماره
صفحات -
تاریخ انتشار 1998